Multilevel Rendering of Document Images
نویسنده
چکیده
Rendering document images for scanning and printing applications typically involves binarization via adaptive thresholding, halftoning, or color dropout. While bitonal rendering is often adequate, there are cases where multi-level rendering is required to capture important image characteristics. In this paper, we present two methods for multilevel rendering of document images. The first method involves adaptive multilevel thresholding of gray scale images based on background tracking. The second method presents color form dropout using color quantization. Both methods are based on a computationally efficient version of the K-means algorithm. The selection of thresholding, halftoning or color dropout depends on the document type and can be applied to the whole image or to various image regions, as determined by a document categorization and segmentation module. Key-Words: Multilevel rendering, color dropout, adaptive thresholding.
منابع مشابه
Multilevel modelling and rendering of architectural scenes
We present a novel approach for multilevel modelling and rendering of architectural scenes using a small set of photographs. Our approach is based on interactive probing of intuitive measures like lengths, widths and heights from single view metrology. These measures can then be aggregated with additional input if need be, for defining high-level primitives such as planes, prismatic blocks, cub...
متن کاملرفع اعوجاج هندسی متون بهکمک اطلاعات هندسی خطوط متن
Document images produced by scanners or digital cameras usually have photometric and geometric distortions. If either of these effects distorts document, recognition of words from such a document image using OCR is subject to errors. In this paper we propose a novel approach to significantly remove geometric distortion from document images. In this method first we extract document lines from do...
متن کاملDocument Analysis And Classification Based On Passing Window
In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...
متن کاملLearning Document Image Features With SqueezeNet Convolutional Neural Network
The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...
متن کاملDocument Image Dewarping Based on Text Line Detection and Surface Modeling (RESEARCH NOTE)
Document images produced by scanner or digital camera, usually suffer from geometric and photometric distortions. Both of them deteriorate the performance of OCR systems. In this paper, we present a novel method to compensate for undesirable geometric distortions aiming to improve OCR results. Our methodology is based on finding text lines by dynamic local connectivity map and then applying a l...
متن کامل